#GURU dataset27/06/2025
GURU: Advancing LLM Reasoning Across Six Diverse Domains with Reinforcement Learning
GURU introduces a multi-domain reinforcement learning dataset and models that significantly improve reasoning abilities of large language models across six diverse domains, outperforming previous open models.